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IN THE CLAIMS: 

1 . (Currently Amended) A method of detecting speech in an incoming signal comprising 
the steps of: 

receiving said incoming signal, extracting an estimate of the noise background of the 
incoming signal and suppressing the noise background of the incoming signal to provide a noise 
suppressed signal in which the estimated background noise has been removed, filtering the noise 
suppressed signal in which the background noise has been removed with a spectral inverse filter, said 
spectral inverse filter is determined by spectrum maxima and the inverse filtering operation 
comprising the steps of: 

in the logarithmic (dB) domain, removing the mean spectral magnitude from the original 
speech spectrum, 

in the mean removed short term frequency spectrum S(i), (i=\. . .128), determining all the 
frequency position (Pj), whose magnitudes are maxima over a window centered around Pj and 
stretching N positions to the left and right of Pj, 

in the list of peaks, adding the first (i=l) and last (i=128) frequency positions, their associated 
magnitudes set equal to the mean of the first and last M x N magnitudes, respectively, wherein said 
M and N are preset constants, 

removing the mean of the peak magnitudes from each peak magnitude, 

if the largest resulting peak magnitude exceeds a predetermined maximum peak value 
MAX_dB_DN, normalizing all peaks so that the largest peaks magnitude becomes MAXJDbJDN, 
and 

the resulting inverse filtering H(i), (i=l . . . 128) is defined as the maximum of the normalized 
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peaks and 0 dB, and 

removing the inverse filter from the original spectrum in the logarithmic domain U(i)= S(i)- 
H(i) and measuring the periodicity of the signal from the inverse filter using an autocorrelation 
function to determine whether a signal frame correspond to a speech frame or not. 

2. (Original) The method of claim 1 wherein said periodicity measurement is defined as: 

Ti, 

p = maxi6c( T ) 

Ti 

where 7} and T h are pre-specified so that the period will range in the range of speech and the signal is 
speech if p is above a given threshold. 

3 . (Original) The method of Claim 2 wherein said period is between about 75 Hz and 400 

Hz. 

4. (Previously Presented) The method of claim 2 where said threshold value is set to 
maximize speech detection accuracy. 

5. (Original) The method of claim 1 wherein said extracting step includes the steps of: 
converting the spectrum of the incoming signal into logarithmic domain, 

removing high frequency components in logarithmic domain by recurrent filtering along the 
time axis, 

establishing an estimate of noise background, converting the estimate into linear domain, and 
suppressing the noise background from the signal, in linear domain. 

6. (Canceled) 
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7. (Previously Presented) The method of Claim 1 0 wherein said inverse filtering is based 
on a normalized approximation of the envelope of the short term speech spectrum derived from a 
local maxima of the short term speech spectrum. 

8. (Previously Presented) The method of claim 7 wherein said inverse filtering is 
performed in a log frequency domain and is implemented by subtracting from the original spectrum 
the estimated inverse filtering spectrum. 

9. (Canceled) 

10. (Previously Presented) A noise-resistant utterance detector comprising the steps of: 
accepting a speech utterance input signal, 

removing background noise from the utterance signal according to a spectral subtraction 
method to get a noise subtracted signal, 

filtering the noise subtracted signal with a spectral inverse filter to get an inverse filtered 

signal, 

locating close low-frequency formants in the noise subtracted signal if they exist and 
inserting spectral valleys between said formants before inverse filtering, 

calculating the autocorrelation from the inverse filtered signal to get an autocorrelation result, 

and 

detecting that a frame of the signal being processed is or is not speech based on a threshold 
applied to the autocorrelation result. 

1 1 . (Currently Amended) The method of claim 10 wherein said spectral inverse filter is 
determined by spectrum maxima and the inverse filtering operation by the steps of: 

in the logarithmic (dB) domain, removing the mean spectral magnitude from the original 
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speech spectrum, 

in the mean removed short term frequency spectrum S(i), (/=!.. .128), determining all the 
frequency position (Pj), whose magnitudes are maxima over a window centered around Pj and 
stretching N positions to the left and right of Pj, 

in the list of peaks, adding the first (i=l) and last (i=128) frequency positions, their associated 
magnitudes set equal to the mean of the first and last M x N magnitudes, respectively, wherein said 
M and N are preset constants, 

removing the mean of the peak magnitudes from each peak magnitude, 

if the largest resulting peak magnitude exceeds a predetermined maximum peak value 
MAX_dB_DN, normalizing all peaks so that the largest peaks magnitude becomes MAX_dB_DN, 

the resulting inverse filtering i/(z), (i=l... 128) is defined as the maximum of the normalized 
peaks and 0 dB, and 

removing the inverse filter from the original spectrum in the logarithmic domain U(i) = S(i)- 

H(i). 

1 2. (New) The method of claim 1 1 wherein said M, N and MAX_dB_DN are pre-selected 
to have values the following values: M=5, N=3 and MAX_dB_DN=3.5 dB. 

1 3 . (New) The method of claim 1 wherein said M, N and MAX_dB_DN are pre-selected to 
have the following values: M=5, N=3 and MAX_dB_DN=3.5 dB. 
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